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We present a bigraphical framework suited for modeling biological systems both at protein level and 
at membrane level. We characterize formally bigraphs corresponding to biologically meaningful sys- 
tems, and bigraphic rewriting rules representing biologically admissible interactions. At the protein 
level, these bigraphic reactive systems correspond exactly to systems of K"-calculus. Membrane-level 
interactions are represented by just two general rules, whose application can be triggered by protein- 
level interactions in a well-defined and precise way. 

This framework can be used to compare and merge models at different abstraction levels; in 
particular, higher-level (e.g. mobility) activities can be given a formal biological justification in terms 
of low-level (i.e., protein) interactions. As examples, we formalize in our framework the vesiculation 
and the phagocytosis processes. 

1 Introduction 

Cardelli in [8] has convincingly argued that the various biochemical toolkits identified by biologists 
can be described as a hierarchy of abstract machines, each of which can be modelled using methods and 
techniques from concurrency theory. These machines are highly interdependent: "to understand the func- 
tioning of a cell, one must understand also how the various machines interact" [8]. Like other complex 
situations, it seems unlikely to find a single notation covering all aspects of a whole organism. In fact, we 
are in presence of a tower of models [19], each focusing on specific aspects of the biological system, at 
different levels of abstractions. Higher-level models must be represented, or realised, at a lower level, and 
where possible this representation must be proved sound; in addition, we need to combine different mod- 
els at the same level. To this end, we need a general metomodel, that is, & framework, where these models 
(possibly at different abstraction levels) can be encoded, and their interactions can be formally described. 

In this paper, we substantiate Milner's idea that bigraphs can be successfully used as a framework 
for systems biology. More precisely, we define a class of biological bigraphs, and biological bigraphical 
reactive systems (BioRS), for dealing with both protein-level and membrane-level interactions. 

An important design choice is that this framework has to be biologically sound, i.e., it must admit 
only systems and reactions which are biologically meaningful, especially at lower level machines (i.e. 
protein). In this way, encoding a given model, for any abstract machine, as a BioRS provides automat- 
ically a formal, biologically sound justification for the model (or "implementation") in terms of protein 
reactions and explains how its membrane-level interactions are realised by protein machinery. 

In order to formalize this "biological soundness", we need a. formal protein model to compare to 
our framework. We choose Danos and Laneve's K-calculus, one of the most accepted formal model of 
protein systems. By suitable sorting conditions, we define a bigraphical framework which allows all and 
only protein configurations and interactions of the ff-calculus. It is important to notice, however, that our 
methodology is general, and can be applied to other formal protein models. 

On the other hand, membrane nesting reconfiguration can be performed by just only two general 
rules, corresponding to the natural phenomena of "pinch" and "fuse" [10]. For encoding a given mem- 
brane model one has just to refine this general schema by specifying when these reactions are triggered, 
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that is, when the right proteins are in place. Indeed, as observed by biologists (see [1] and [2, Ch. 15]), 
membrane interactions present always a "preparation phase", where membrane proteins (receptors and 
ligands) interact, followed by the actual "membrane reconfiguration" phase. The "pinch" and "fuse" 
rules model the latter phase, whilst the preparation phase depends on the specific proteins involved and 
hence left to the encoding of the specific model under examination. 

Another important consequence of encoding a model in our framework is that any "too abstract", 
or non-realistic, aspect of the model is readily identified, because its formalization turns out to be prob- 
lematic, or even impossible. In this case, one has to change the model, or the encoding, in order to 
be biologically sound. As a result, the bigraphical encoding of a given system may exhibit unexpected 
features, not present (or unpredicted) in the original model. Far from being a problem, this allows to 
foresee emerging properties, such as behaviours due to interactions of different abstraction levels, and 
which cannot be observed within a single machine model due to its intrinsic abstractions. 

A further motivation for our framework comes from the many general results provided by bigraphs. 
We mention here only the construction of compositional bisimilarities [18], allowing to prove that two 
systems are observational equivalent, that is, they can be exchanged in any organism without that the 
overall behaviour will change. Also for this application it is important to restrict the bigraphs allowed by 
the framework to only those biologically meaningful. 

We summarise briefly our approach. In Section 3 we first define protein link graphs and correspond- 
ing reactive systems, which correspond precisely to protein solutions and protein transition systems of 
the K"-calculus (recalled in Section 2.2). In Section 4 we extend this approach to deal with compart- 
ments, introducing biobigraphs. Membranes are represented by two new nodes, which play no role at 
the protein level, but which can contain other systems. Appropriate sorting conditions will enforce bio- 
logical properties such as bitonality and orientation of membranes. Two applications of this frameworks 
are given in Section 5: a formal description of vesicle formation process, and a formalization of the Fc 
receptor-mediated phagocytosis. In both cases, the framework obliges to provide a formal justification 
of (membrane mobility) reactions in terms of protein-level interactions. Conclusions are in Section 6. 



2 Preliminaries 

2.1 Bigraphical reactive systems 

In this section we recall the bigraphical framework [18], extending the variant of [5] with typed names. 

Bigraphs A bigraph represents an open system, so it has an inner and an outer interface to "interact" 
with subsystems and the surrounding environment. The numbers in the interfaces describe the roots in 
the outer interface (that is, the various locations where the nodes live) and the sites in the inner interface 
(that is, the grey holes where other bigraphs can be inserted). On the other hand, the names in the 
interfaces describes the open links, that is end points where links from the outside world can be pasted, 
creating new links among nodes. An example of a bigraph is shown in Fig. 1 . 

A signature is a quadruple (Jf,ar, 2? where is a set of node controls with an arity function 
ar : — > N, ST is a poset of types with ordering C, and ^C^ denotes the edge types. 

Definition 2.1 (Interfaces and Bigraphs). An interface is a pair (n,X), where n is a finite ordinal (called 
width) and X is a set of typed names x: t, where t G ST. 

A bigraph G is composed by a place graph G p , describing the nesting of nodes, and a link graph G L , 
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inner name 



place = root or node or site Mnk = edge or outer name 

point = port or inner name 

Figure 1: Example of a bigraph and its structure (picture from Milner's slide). 

describing the (hyper-)links among nodes. 

G p = (V, ctrl,prnt) : m^n (Place graph) 

G L = (V,E, Ctrl, edge, link) : X — > Y (Link graph) 

G = (G P ,G L ) = (V,E, ctrl, edge, prnt, link) : (m,X) -> (n,Y) (Bigraph) 

where V,E are the sets of nodes and edges respectively, ctrl : V — ► is the control map, which assigns 
a control to each node, edge: E — > S{C. 3F) is the edge typing which assigns a type to each edge, 
prnt: mHJV —> V fcfcln is the (acyclic) parent map, the disjoint sum /Vt = L v ev ar ( c ^( v )) * s tne set °f 
ports (associated to all nodes), and Zmfc: X\±lPrt — >El±iy is the (type consistent) Zmfc map, i.e., for all 
x: t G X, if iinife(x: t) = l:t' then t C t'. 

In the following, we say place to mean a node, a root or a site, and we call the elements in the domain 
of the link map points, while the ones in the codomain links. An idle name is not a target of any point. 
We denote by (v, i) the z'th port of the node v. Two nodes are peer if two of their port (one per node) 
have the same target link. Often, we say control to mean a node with associate that control, and we write 
x G X in place ofx:t£X when the type of the name it is clear from the context. 

Definition 2.2 (Bigraph category). Given a signature (J(T,ar, 3T,£), the category of (link typed) bi- 
graphs Big(J^, , S) has interfaces as objects and bigraphs as morphisms. 

Given two bigraphs G : (m,X) —>■ (n,Y), H : (n, Y) — > (k,Z), the composition HoG: (m,X) — > (k,Z) 
is defined by composing their place and link graphs: 

Place: H p o G p = (V, ctrl, (idv G U prntu) ° (prntc U /c/y„ ) ) : m —> k; 

Link: // L oG L = (V,E, ctrl, edge, (id£ G Ulinkn) ° (linkcUidp H )) : X —> Z, 

where V = Vg U V}/, £ = Eg U Ei/ (supposing Vg H V# = Eg H E# = otherwise a renaming is applied), 
ctfrZ = cfr/c U ctrln and et/^e = edgeQ U edgen- 

Plg and Lnk will denote the categories of place and link graphs, respectively. 

An example of splitting of a bigraph in its components is shown in Figure 2. 

Intuitively, composition is performed in two steps. First, the place graph are composed by putting the 
roots of the "lower" bigraph inside the sites (i.e., holes) of the "upper" one, respecting the order given by 
the ordinal in the interface; then, the links are connected by sticking together the end parts of connections 
in the two link graphs, labelled with the same typed name in the common interface. 
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sites... 12 xq xi ... inner names 



Figure 2: Splitting of a bigraph (picture from [18]), where m = 2, n = 1, X = {xq,xi}, Y = {yo,yi}. 

An important operation on bigraphs is the tensor product. Intuitively, the tensor product of G : 
(m,X) -> (n,Y) and H : (m',X') -» <n',F'), defined when XnX' = Y C\Y' = 0, is the bigraph G<g>H : 
(m + m',X\SX') — > (n + n', Y l+J y') and is obtained by putting "side by side" G and //. Two useful variant 
of tensor product can be defined using tensor and composition: the parallel product ||, which merges 
shared names between two bigraphs, and the prime product |, that moreover merges all roots in a single 
one. We refer to [18] for formal definitions. 

Bigraphical reactive systems (BRSs) are reactive systems in the sense of [17], built over the category 
BlG( J£^, , £) [18, 5]. A BRS consists of a set of bigraphical rewriting rules, together with a definition 
of the evaluation contexts (i.e., bigraphs with holes) where rules redexes can be found in order to be 
rewritten. Such contexts are defined as a sub-category srf of active context inside Big(J?T, $). 

Definition 2.3 (Activity). Each control of a signature is either active or passive or atomic. An atomic 
control cannot contain any node (hence it is a leaf on the place graph). A location (i.e., an element in the 
domain of prnt) is active if all its ancestors have active controls (roots are always active); otherwise it is 
passive. A context is active if all its holes are active locations. 

It is easy to check that active contexts form a compositional reflective sub-category of bigraphs, 
which we denote by srf , the category of active contexts. 

Definition 2.4 (BRS). A bigraphical reactive system S>(jf, is formed by BlGpf \^,£) 
equipped with the subcategory ACT of active contexts, and a set ffl of (parametric) reaction rules, that is 
pairs L,R : (m,X) — > (n,Y) (usually written as L -> R). The reaction relation -> defined by a BRS is the 
relation between ground bigraphs given by the following rule 1 : 

CGACT L^Re£g Jground G = CoLod H = CoRod 

G^H 

Clearly, these definitions can be applied to place and link graphs too. Active contexts for place graphs 
are as for bigraphs and the rules can have only holes; instead for link graphs, all contexts are active. 

Sortings Usually, bigraphs defined over a given signature are "too many". A systematic way for ruling 
out unwanted bigraphs is by means of a sorting [4], that is, a functor F : S — > BIG, faithful and surjective 
on objects, defined on a category S where unwanted morphisms from BIG are deleted. A sorting can be 

'Milner's definition of BRS allows for non-linear instantiations, but for the scope of this paper linear rules are sufficient. 
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conveniently defined by means of a suitable logical condition specifying the class of morphisms we want 
to restrict to [4]. To ease readability, in the paper we will present these conditions in a semi-formal logical 
language; a formal description of all these conditions can be given as BiLog formulas [12]. Notice that 
also the name typing introduced in Definition 2. 1 can be expressed by means of a sorting. 

2.2 The K-calculus 

Here we recall Danos and Laneve's fc-calculus [14], a calculus for protein interactions, which will be our 
reference protein-level model in Section 3. 

Let 2? be a finite set of proteins and jY an enumerable set of edge names. Let s : — > N be a 
signature, that assigns to every protein the number of its domain sites. A fc-interface is a map from N 
to jV l±l {h, v} (ranged over p, a, ...). Given an interface p and a protein name A G 2 s , a site (A, i) is 
visible if p(i) = v, hidden if p(i) = h, and tied if p(i) G JY . A site is free if it is visible or hidden. In 
the following, we will write p = 1 +2 + 3 X to mean p(l) = v, p(2) = h, and p(3) = x. The syntax of 
fc-solutions is the following: 

S,T::=0\A(p)\S,T\(x)(S) 

up-to a -equivalence and the least structural equivalence (=), satisfying the abelian monoid on compo- 
sition and the scope extension and extrusion laws. The sets of free edge names fn{p) and fn(S), on 
interfaces and solutions, are defined as usual. 

Definition 2.5. The set of connected K-solutions is defined inductively as: 

S ST fn(S)nfn(T)^<d S S = T 
A(p) JxjS S\f " T 

Definition 2.6. A fc-solution S is (strong) graph-like if free names occur at most (exactly) two times in 
S, and binders tie either or 2 occurrences. 

A complex is a closed, graph-like, and connected solution. 

Biochemical reactions are either complexations (i.e. where low energy bonds are formed on two com- 
plementary sites) or decomplexations, possibly co-occurring with (de)activation of sites. Causality does 
not allow simultaneous complexations and decomplexations on the same site: this biological constraint 
is assured in the fc-calculus by the notion of (anti-)monotonicity, which forces reactions not to decrease 
(increase) the level of connection of a solution (see [14] for details). 
Definition 2.7 (Growing relation). Let x be a set of fresh names. 

A growing relation > over ^-interfaces is defined inductively as follows: 

iEi Jcn/n(p)=0 xhp>(7 xhp'oa' 
x\-i>i xhi\>i xhi>i x xhpi>p ihp+p'>0 + ff' 

A growing relation > over fc-solutions is defined inductively as follows: 

xhS\>T xhp>a xhS>T fn(o)Cx 
xhO>0 xh S,A(p)>T ,A(o) x^S>T,A(o) 

Definition 2.8 (Monotone reactions). Let L,R be two K"-solutions. L — > (x)R is a monotone reaction 
if x h L > R, L and (x)R are graph-like and R is connected. L — > (x)R is an antimonotone reaction if 
R — > (x)L is monotone. 

Finally, we are able to characterize a protein transition system: 
Definition 2.9 (Protein transition systems). Given a set 3% K of monotone and antimonotone reactions, 
we define a protein transition system (PTS) as a pair such that ^ is a set of K"-solutions, and ^ 

is the least binary relation over 5? such that & K C closed w.r.t. =, composition and name restriction. 
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S:{x} 




X 



s: 0-> {*} 

HIDDEN P 

Figure 3: An example of a wrong composition. 

3 Protein link graphs 

In this section we introduce a general graphical formalism for modelling biological protein interactions. 
Proteins are represented as atomic nodes where links stand for chemical bonds. In such a model locations 
are not needed, hence we do not make use of the full bigraphical framework and link graphs are enough. 

A protein changes its behaviour depending on its folding structure which determines its current inter- 
acting interface (the set of its domain sites). This structure could change after a complexation reaction. 
To describe the interface of each protein we use four linking types, v (visible), h (hidden), b (bond) and 
f (free), indicating the status of the corresponding protein site which they are connected to. 

Definition 3.1 (Protein signature). Let & be a set of protein controls, a protein signature (^P,ar) is 
defined as (^,ar, 3^ P ,S P ), where & p = {h,v,b,f} with [l p = id# p \J{(f,t) : t G 3f p }, and S = {h, v, b}. 

We say that a site is visible, hidden, bond or free if it is connected to an link of type v, h, b or f, 
respectively. In principle, hyperlinks can connect more than two ports, yielding biological meaningless 
link graphs because a bond involves at most two sites. Clearly, we want this condition to be preserved 
under composition (it is always preserved by tensor). Actually, two good protein solutions could be 
composed yielding an incorrect solution, as in the Figure 3. 

Well-formed link graphs can be defined as follows: 

Definition 3.2 (Protein solution). A protein solution S : X — > Y is a link graph over a protein signature 
SP, such that every link has at most two peers, and every link of type v or h has exactly one peer: 

Wl:t.t e{b,f}^\link~\l:t)\ G{1,2} and V/ :t.t G {h,v} => \UnkT l {l :t)\ = 1 (ProtSol) 

Definition 3.3 (Protein link graphs). The category of protein link graphs LGP is the category of link 
graphs sorted on the predicate ProtSol. 

Let us now consider possible reaction rules on link graphs. As for fc-calculus, reactions allow com- 
plexations only between two visible sites, hence tied sites must be freed before being involved in any 
protein interaction. Using the same approach of [14], protein reactions can either increase or decrease 
the level of connection of protein complexes; they can introduce new nodes (protein synthesis) or remove 
nodes (protein degradation). Formally, we define a growing relation: 

Definition 3.4 (Growing relation). The growing relation t> over pairs (ft), S) of wirings ft) : Z — > Y and 
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Figure 4: An example of a growing solution: (0,S>jJ.,T. 



discrete solutions S : X — > Z is defined inductively as 

(empty) 

(hide) 



co,St>co',S' 



id% , id® t> id® , id® 



/z:v<S>(a,S> /z:h<8)fl),5 



(D®n,S®T\>co'<giH,S'<giT 

/z:h(gico,S[> /z:v®(Q,S 
(tie) 



(tens) 
(reveal) 



/zi : v (g) /z 2 : v (g> a), 5 [> (/z: h o z /{ Zl , Z2 }) (gi oj, 5 
r:0^{w}kt)W /i:W->0 



/z: v (g> a>, 5 > (/z: h o (z/w ®id t ))®(0®^l,S®T ( synth ) 

The growing relation can be lifted to morphisms of LGP: G \> H iff co, G' \> pL , where (QoG' , jJ.oH' 
are the discrete decompositions of G and H, respectively. 

Let G\>H, according to (hide) and (reveal) H can toggle free sites in G from visible to hidden and 
vice versa, whereas (tie) allows H to bind only pairs of sites that are visible in G. The (synth) axiom 
adds new proteins in the solution H (possibly connected to proteins already in G) if their ports are all 
linked to edges in H. 

Note that the discrete decomposition of a protein link graph induces a separation among protein 
interfaces and protein nodes. An example is given in Fig. 4. 

Now we can introduce the notion of protein reactive systems, where rewriting rules respect causality. 
This is guaranteed by requiring rules to either grow or shrink solutions, representing complexations and 
decomplexations respectively. 

Definition 3.5 (Protein reactive systems). A rule (Z, r) on protein link graphs is monotone if l>r and r 
is connected; (r, I) instead is antimonotone. 

A protein reaction system (PRS) is a reactive system over LGP, whose all reaction rules are monotone 
or antimonotone. 

We establish now a formal correspondence between protein reactive systems and protein transition 
systems. We use the set of fc-protein names @* as the set of protein controls in LGP, using the corre- 
sponding K"-arity function s : 2? — > N. 

Now, given a setX of typed names, the encoding function (| • \x mapping K-solutions into LGP is 

mx=X <\(x)(S)h = /y--b°1S{y.b/x:t}\) X u{r.b} where ?G{f,b} 

<\s,Th = flsD* || <\Th <\A(p)h =x || (aGSMODS) °a Zi .,,., Zs(a) .,) 

where <\h\) x = /z:b Hx = /zv Ml =x: b/z: f (if xeX) 
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T T 



(x)(A(I + 2 + 3- v ) ,5(1* + 2 + 3^)) h 



2 3 



Figure 5: Encoding a K-solution in a protein link graph. 



Note that <\S\)x is defined only when fn{S) C X. Fig. 5 shows an example. 
We can prove the following correspondence results. 

Proposition 3.1 (Syntax). For any set of names X, S = T iff <\S\) X = <\T \) X . 
Moreover, for all X, the map (| • \) x is surjective over the homset Lgp(0,X). 

Definition 3.6 (fC-PRS). A k-PRS is a reactive system over LGP whose set of rules is defined as 
^ = {((KD/>i(/)>d r D/n(V)) I (7> r ) £&k] > where S% K is a fc-calculus protein transition system. 

Proposition 3.2. Every fC-PRS is a protein reactive system. 

Proof, (sketch) Just prove that growing relations on K-solutions and protein link graphs correspond. □ 
Theorem 3.1 (Semantics). L ~> (x)R iff (JL^ X -> ^ (x)R\) x , where xHX = 0. 

Proof. Follows from Proposition 3.1 and 3.2. □ 



4 Biological bigraphs 
4.1 Biobigraphs 

A cellular membrane is a closed surface that acts as a container for biochemical solutions. One of the 
most remarkable properties of biological membranes is that they form a two-dimensional fluid (a lipid 
bi-layer) in which hydrophobic proteins (often only their hydrophobic sub-unit) can be immersed and 
freely diffuse. As a consequence membranes are localities where protein nodes can reside in them. We 
will represent them as two nested nodes, m cys and m ext (for cytosolic and extra-cytosolic layers), so that 
membranes behaves as hydrophobic protein containers. Formally, the cellular membrane signature is 

^4{ m c y s : 0,m ext : 0} 

where both controls are active, since biological reaction can happen inside a cell. 

Membranes come in different kinds, distinguished mostly by the trans -membrane proteins embedded 
in them. Such proteins have a consistent orientation and can act on both sides of the membrane simul- 
taneously, propagating external signals inside the membrane-compartment (and vice versa). Membrane 
behaviour is determined by proteins embedded in them, hence membranes inherit the consistent orienta- 
tion of their trans-membrane proteins. In our model we express trans-membrane proteins as complexes 
formed by an hydrophobic unit linked to one or more hydrophilic units (see Fig. 6(b)), in order to make 
explicit the protein orientation and, at the same time, to formally explain the orientation of membranes 
at the level of proteins [8, 10]. 

Membranes themselves are impermeable containers and they do not allow large molecules (such as 
proteins) to simply traverse them. At the same time protein (de-)complexation cannot take place through 
the membrane, because complexations take place only if their protein domains are close to each other. 
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As a consequence, protein complexes like the one in Fig. 6(a) are not biologically plausible. In order 
to get a bigraphical category of only well-formed biological bigraphs, we rule out these morphisms by 
applying the following condition: 

if v, w peer and prnt(v) = prnt k (w), then -i(9 1 < i < j <k. ctrl(prnt l : (v)) ,ctrl{prnt J (w)) £ ff) (Imp) 

which says that two nodes cannot be linked if there are more than two occurrences of a membrane control 
along the path to the common ancestor. 

This is not the only well-formedness condition to be satisfied by a biological bigraph. In fact, with 
respect to the bi-layered membrane representation, we have to rule out other ill-formed bigraphs. A 
priori one could insert m ext directly into another m ext , obtaining an ill-formed membrane. Moreover we 
want hydrophilic (polar) proteins not to reside on membranes and, analogously, hydrophobic (apolar) 
proteins not to reside in water. Hence, we have to rule out systems violating the polarity constraint, by 
means of a sorting over the place graph. To this end, we assume that the protein signature is partitioned 
in polar and apolar stf controls; moreover, we say that m ext is polar and m cys is apolar. Then, the 
polarity constraint can be expressed by the two following predicates (controls p c , p m will be introduced 
and explained later): 

if ctrl(y) is polar, then prntiy) is apolar, or ctrl(prnt(v)) = p c (POLAR) 
if ctrl(v) is apolar, then prntiy) is polar, or ctrl(prnt(v)) = p m (Apolar) 

However the polarity condition does not forbid wrong systems as those shown in Fig. 7(a) and (c). 
To this end, we need also the following condition to ensure the correct nesting of membrane layers: 

if ctrl(v) = m ext then 3\v' s.t. ctrl{v) = m cys A prntiy') = v (2LAYER) 

The vice versa is not necessary, indeed the system in Fig. 7(b) is well formed. 

Let us now consider the dynamics of biological systems. As expected, we recover all protein reaction 
rules from PRSs but extended with locations, and requiring that proteins cannot change their position. 
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Figure 8: Mobility commitment rules: pinch and fuse. These rules preserve bitoncility (regions with same 
pattern have the same tonality). 



Moreover, taking into consideration membranes, our model must deal also with membrane-transport. 
Membrane reconfigurations are fully justified at the protein-level; for instance, extension of pseudopods 
during a phagocytosis is triggered by the reorganization of the actin cytoskeleton [1]. In our model the 
protein justification for membrane transport is ensured by splitting membrane reconfigurations into two 
steps: 

1 . The first step adds some special mobility controls, which describe the intent of executing an action 
(e.g. phagocytosis) and its direction (e.g. what is eating what). Moreover, these controls "freeze" 
the sub-systems, until the action will be completed. 

2. The second step performs effectively the action, operating on the place graph, changing the posi- 
tion of systems (e.g., one is moved inside the other), removing the mobility controls and adding or 
removing a double layer. 

In order to implement this scenario, we need two new kind of rules: (mobility) introduction rules, for the 
first step, and (mobility) commitment rules, for the second step, i.e., the membrane reconfiguration. 

Let us discuss the latter first. Actually, all membrane interactions can be reduced to pinching and 
fusing of membranes [10]. Hence, we add exactly two mobility commitment rules, namely pinch and 
fuse, depicted in Fig. 8. These rules use some special controls to describe which parts of the system 
should be moved from a location to another; these controls are defined in the following mobility signature 

J(±Jt P \JjZ F where JC P = {p c : 1, p m : 2, p d : 1} JC F 4 {f c : l,f m : 2,f d : 1} 

All mobility controls are polar, except for p m and f m ; moreover p d ,f m ,f d are atomic whereas p c , p m ,f c 
are passive, so that they temporally "freeze" the part of system that will be moved. Notice that the rules 
pinch and fuse are bitonal, in the sense of [7, 10]. Preservation of bitonality means that reactions must 
preserve the even/odd parity with which components are nested inside membranes (see Figure 8). 

Mobility controls just help a mobility reaction to be modeled, and there is not a biological reason to 
admit a mobility node to be peered with a protein one. Moreover, mobility controls make sense only if 
tripled-up as in Figure 8, hence we allow only these linkage combinations. In addition, mobility controls 
cannot be placed freely. For example, there is no biological motivation to put p c and p d in the same 
location, since no cell can ingest itself; similar observations are valid for fusions. Hence, in order to 
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(Dm) 



discipline these rules, we introduce a sorting to forbid illegal membrane reconfigurations: 

if link((v,i)) = link((w,j)) then (ctrl(v),ctrl(w) G 0>\j£/) V 

(ctrl(v) = p m A((ctrl(w) = p c A i = 0) V (ctrl(v) = p d Ai = 1))) V 

(ctrl(v) =f m A((ctrl(w) = f c Az=0) V (ctrl(v) = f d Ai = 1))) 

and if ctrl(v) G Jt then V/ G ar(ctrl(v)). Se. lin]C x {e) = {(v,z)} (MOBIL) 

if (cfrZ(v) = p c ,c?r/(w) = p d ) V {ctrl{v) = f c ,ctrl(w) = f d ) 

and u peered with v and w, then prntiy) / prntiw) 

iSctrl(v) G ^# then V/c > 0. ctrl{prnt k {v)) £ {p c , p m ,f c } (NoNESTlNG) 
if prntiy) = ^r^(w) and ctrl{prnt(v)) = f c then v = w and ctrl(y) = m ext (FUSE) 

Intuitively, MOBIL forbids that a mobility node peers with a protein node, forcing it to be linked with 
another mobility node of the same type (P or F). Dm disallows mobility controls p c , p d and f c , f d 
to reside in the same location. Note that Imp forces mobility controls triples to do not violate the bi- 
layer impermeability, consequently such triples are connected in a well-formed manner. NoNESTlNG 
guarantees that mobility controls cannot be nested, hence disallows ambiguities on how performs actions. 
Finally, FUSE forces the control f c to have only one child of type m ext , because fusion can take place 
only by two cells. 

We can now define biological bigraphs as the bigraphs which satisfy all the sorting conditions we 
have given so far. 

Definition 4.1 (Biological bigraphs). Given two sets SP^si of polar and apolar proteins, a biologi- 
cal signature is an extension of a protein signature and it is defined as J^e(^ , , s g/) = U 2? U U 
^C, ar, {h, v}) with the condition that all controls in U szf are atomic. 

The category of biological bigraphs, denoted by Bl0BG(J£B(^,.e/)), is the category of bigraphs 
over &/) sorted by the following predicate: 

ProtSol a Polar a Apolar a 2Layer a Imp a Mobil a Dir a NoNesting . 

Now, let us consider the mobility introduction rules. Even if membrane reconfigurations are of only 
two kinds (pinching and fusing), they can be triggered by distinct of protein interactions (indeed no a 
single kind of membrane internalization exists, e.g. pinocytosis and phagocytosis). Hence, how mobility 
controls are introduced is left to the modeler (possibly a biologist), so that he can distinguish between 
the variety of signaling events that cause a membrane transformation. However, not all introductions of 
mobility controls are allowed: we want introduction rules not to perform protein reactions. Formally: 

Definition 4.2 (Introduction rules). A mobility introduction rule is a reaction rule (L,R) over biobi- 
graphs such that 

• ctrl(V L ) n^f = and ctrl(V L ) n Us/) / 0; 

• L = (L' || L'j || L' 2 ) o (Lq II L" || 4'), where U h L'! of width 1 for i G {0, 1,2}, 

R = [L' || L\ || L> 2 ) o ((/ x ,y: h o ( p c || (p- | id,) || (p d | id,))) ® id z ) o (L» \\ L>> \\ I"), and 
every z G Z is the target link of two distinct ports or of a port of a node in L' 2 \ 

• L = (L' || L[ || L' 2 ) o (Lq (I L" || l! 2 r ), where L'^L'f of width 1 for i G {0, 1,2} and 

R = (L' Q || L\ || L> 2 ) o ((/ X ,y: h o (f_^ || (fm I idl ) || (f d I id,))) ® idz) o (L'> || L'l || L'i). 
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The first condition states that a membrane reconfiguration can be justified only by protein interactions 
and not by the presence of mobility controls. In both other conditions, we require that the left hand side 
of the rule can be split into two parts of width 3. (L' ' || L" \\ L![) specifies the proteins that effectively 
takes part in the mobility action, whilst (L' \\L[ \\L' 2 ) describes any protein not involved in the action but 
required to trigger it. The extra condition for pinch introduction guarantees that the related commitment 
rule always takes place, i.e., it does not stuck for not violating the impermeability condition. 

We ensure that no protein reactions can be performed, since the redex L and reactum R only differs 
from an occurrence of a mobility triple. In Section 5 we will give two examples of how mobility controls 
and introduction rules are used to model membrane transport. 

Now, we can define the notion of biological reactive systems. 
Definition 4.3 (BioRS). A biological reactive system (BioRS) is a reactive system over the category 
of biobigraphs BioBg(J£b(^, g?)) and equipped with rules 3%b = { pinch, fuse} U^", where 3% is a 
given set of protein and introduction rules. 

4.2 Relating biological bigraphical reactive systems 

In this section we establish formal connections between biobigraphs and protein link graphs and "mobility- 
only" bigraphs, by means of two functors; this allows to focus on one biological aspect per time. 

The protein functor F P : BioBg(J^j( J 2 , s/)) -> Lgp(^ U s/) forgets all the membrane and mobil- 
ity controls and polarity aspects of a system, "flattening" it to a protein solution. Formally: 

F P ((n,X))=X 

F P ((V,E, Ctrl, edge, prnt, link)) = {V 1 ,E' ,ctrl\ yl ,edge\ EI ,link\ dom( j ink )\ Prt ^ v \ vl) ) 

V' = {v G V | ctrl(v) <£tfU^} E' = E\ {link((v, i)) G E \ v G V \ V} 

On the other hand, the mobility functor F M - BioBg(J£b(^,.s/)) — > MBG deletes all proteins 
present in the systems and focuses only on the structural and mobility information. Formally, let MBG = 
BioBg(J£b(0,0)) be the category of "pure mobility biobigraphs", and define Fm as follows 

F M ({n,X)) = (n,X) 

F M ((V,E, ctrl, edge, prnt, link)) = (V' ,E' ,ctrl\ v , ,edge\ E > , prnt\ v , ,link\ dom(link )\ P( y\ v ,)) 

V' = {v G V | ctrl{v) U s/} E' = E\ {link{{v, i)) G E \ v G V \ V'} 

These functors allows to give a formal justification of membrane-level actions in terms of simpler 
interactions, as it is formalized in the next result. 

Theorem 4.1. Let @ be a BioRS over Bl0BG(^(^,i/)), with rules St. Let F P {9) be the protein 
reactive system over Lgv(^P\Js/) whose rules are F P (3$\ {pinch, fuse}). Let Fm(@) be the BioRS 
over MBG whose rules are Fm{S?). 

For all G,H ground bigraphs in BioBg(J*b(^,^)), if G -> H in 9 then 

. either F P (G) + F P (H) or F M (G) + F M (H); 

• if F P (G) / F P (H) then F P (G) -> F P (H) in V P {®)\ 

• if F M {G) + F M {H) then F M (G) -> F M (H) in F M (@). 

Intuitively, this theorem states that the protein and mobility functors "project" the execution traces 
of a biobigraphical system into LGP and MBG, respectively, as shown in the diagram in Figure 9. In the 
first case, only the interactions between proteins are observed, but not their structural effects; in the latter 
case, the evolution of mobility controls and positions of cell is observed, but not the protein interactions 
which actually cause these dynamics. 
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Figure 9: Mapping biobigraphical steps to either protein or mobility reactions. 

5 Examples 

5.1 Vesicles formation 

In this section we give a formal bigraphical description of the vesiculation process which is one of the 
most fundamental transport mechanisms in cells. 

Vesicle transport is used to translocate proteins and membranes from the endoplasmatic reticulum 
(ER) to the plasma membrane via the Golgi apparatus, or to ingest macromolecules by receptor-mediated 
endocytosis at the plasma membrane. Vesicles form by budding from membranes. Each bud has distinc- 
tive coat protein on cytosol surface, which shapes the membrane to form a bubble. The bud captures the 
correct molecules for outward transport by a selective bond with a cargo receptor attached through the 
double layer to the coat protein complex, which is lost after the budding completes. Here we consider 
the endocytosis at the plasma membrane where clathrin coated vesicles are formed. 

In order to represent this vesiculation process, we introduce the protein signature 

& v = {cargo: l,rec c >' v : 2,rec ext : 2,adpt: 2,clath: 1} and st, = {rec m : 2} 

and finally the corresponding reaction rules for their interactions (graphically depicted in Figure 10): 

(/x,y:v®/z,w,k: h) o ((cargo x | rec*f ) || rec™ || rec^f) — > (rec) 

(A:v®/£,z,w:h)o(cargo, | recff) || rec™ || cys^) 

(/y:h®/x,£:v)o(rec^ | adpt x> .) — > (/y:v<g)/fc:h)o(rec^ | adpt fey ) (adpt) 

jyz: vo (adpt^, | clathj — > /b: h o (adpt^ | clath^) (coat) 

/b,b': ho(rec^ s | adpt bb i | clathy) — > (/x,k:v (g> /&:h)o(rec^ s \ adpt xh , | clathfo/) (uncoat) 

The vesicle is spawn when the correct number of clathrin-adaptin-receptor-cargo molecule com- 
plexes is formed. Let (P-cmplx) n be the bigraph representing n complexes, defined as: 

P-cmplx = /x,y,z,w,k: ho ((cargo., | rec e x f \ id]) || (rec™ | id]) || (recgf | adpt W(t | clath^ | id\)) 

(P-cmplxf = (1 || 1 || 1) (P-cmplx) n+l = P-cmplxo {P-cmplx) n 

Then, the actual creation of membranes is formalized by a single mobility introduction rule, which is 
triggered in presence of the right number of complexes: 

(P-cmplx) n — > /x,y: h o (p' || p™ || P> d ) o (P-cmplx) n (P-intro) 

After the application of this rule, the vesiculation process is completed by the pinch rule. An execution 
trace of this process is graphically depicted in Fig. 11. 
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Figure 10: Rules for vesicles formation. 




Figure 11: Receptor-mediated endocytosis pathway (compare with [2, Fig. 15.19]). 
5.2 Fc receptor-mediated phagocytosis 

Phagocytosis is the process whereby cells engulf large particles. Phagocytosis is triggered by the inter- 
action of antibodies that cover the particle to be internalized with specific receptors on the surface of the 
phagocyte. Here we present a simplified example of signaling transduction pathway for FcR-mediated 
phagocytosis taken from [15]. Fc receptors bind to the Fc portion of immunoglobulins (IgG), which trig- 
ger cross-linking of FcR and, hence, the activation of their enzymatic sub-units. This initiates a variety 
of signals which lead through the reorganization of the actin cytoskeleton, and membrane remodeling, to 
the formation of the phagosome. Here we focus mainly on the modeling criteria by which we can encode 
as a BioRS a phagocytosis reaction using introduction and commitment rules. Due to lack of space we 
show only the mobility introduction rule; the design of protein rules is similar to that in Section5.1, and 
their definition can be easily recovered by the signaling pathway shown in Fig. 12. 
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Figure 12: Fc receptor-mediated phagocytosis of an IgG-opsonized particle. 



6 Conclusions 

In this paper we have presented a framework for protein and membrane interactions, based on Milner's 
bigraphs. Assuming the protein-level interactions as the fundamental machine, we have characterized 
formally a class of link graphs and their reactive systems which correspond exactly to protein solutions 
and protein transition systems definable in the ff-calculus [14]. Then we have extended our approach 
to the membrane machine, whose mobility activity can be rendered just by two fixed rules. As example 
applications of this framework, we have modelled the formation of vesicles, and the Fc receptor-mediated 
phagocytosis, giving a formal justification at protein-level of membrane-level actions. 

Related and future work. The idea of representing biological systems using bigraphs goes back to 
Cardelli and Milner [6, 18]. To our knowledge our framework is the first "foundational metamodel", 
adequate at the protein-level, and allowing to describe formally the relations between biological models 
at different abstraction levels, according to the "tower of models" vision. 

Damgaard et al. have developed the ^-calculus [13], an interesting language for modelling inter- 
actions of membranes and proteins. There are many relevant differences between biobigraphs and the 
'rf -calculus. First, the ^-calculus is focused on protein diffusion through "channels", and it does not 
allow for phagocytosis as the one described using biobigraphs in Section 5.2. Moreover, one of the 
aim of biobigraphs is to explain higher-level interactions with lower-level reactions, according to the 
"tower of models" vision, and to be adequate with respect to the ff-calculus. According to this vision, in 
biobigraphs membrane reconfigurations are logically diffent from protein reactions, but the connection 
between the two aspects is formally specified. In the ^-calculus, mobility and protein reactions are more 
intertwined, and a formal connection between ^-calculus and K"-calculus has not been investigated yet. 
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In fact, we can say that our approach is model-driven, that is, we have striven for an adequate (bi- 
graphical) model before defining any language. In fact, in [3] we present Bio/3, a new (meta-)language 
inspired and corresponding to the bigraphical framework presented in this paper, and covering both pro- 
tein and membrane aspects. This language can be given an efficient, ad hoc implementation. 

A stochastic version of bigraphical reactive systems has been introduced in [16], for which systems 
biology is indicated as the primary application. Our work is complementary, and orthogonal, to [16]; 
indeed, we can easily think of "stochastic biobigraphs", endowing rules with rates, for dealing also with 
quantitative aspects, like reaction speeds, concentration ratios, etc., and eventually also the gene machine, 
in the line of recent approaches [9, 1 1]. 

Another interesting aspect to investigate is the bisimilarity definable via the IPO construction on 
bigraphs [18], which is always a congruence. This can be useful e.g. in synthetic biology, for instance, 
to verify if a synthetic protein (or even a subsystem to be implanted) behaves as the natural one. 

References 

[1] A. Aderem and D. M. Underhill. Mechanisms of phagocytosis in macrophages. Annual Review of Immunol- 
ogy, 17(l):593-623, 1999. PMID: 10358769. 
[2] B. Alberts, D. Bray, K. Hopkin, A. Johnson, J. Lewis, M. Raff, K. Roberts, and R Walter. Essential Cell 

Biology, 2nd edition. Garland, 2003. 
[3] G. Bacci, D. Grohmann, and M. Miculan. A framework for protein and membrane interactions. In 

G. Ciobanu, editor, Proc. MeCBIC'09, EPTCS, 2009. 
[4] L. Birkedal, S. Debois, and T. T. Hildebrandt. On the construction of sorted reactive systems. In R van 

Breugel and M. Chechik, editors, Proc. CONCUR, LNCS 5201, pages 218-232. Springer, 2008. 
[5] M. Bundgaard and V. Sassone. Typed polyadic pi-calculus in bigraphs. In Proc. PPDP, pages 1-12. 2006. 
[6] L. Cardelli. Bioware languages. In K. S. J. Andrew Herbert, editor, Computer Systems: Theory, Technology, 

and Applications - A Tribute to Roger Needham, pages 59-65. Springer, 2004. 
[7] L. Cardelli. Brane calculi. In Proc. CMSB, LNCS 3082, pages 257-278. Springer, 2004. 
[8] L. Cardelli. Abstract machines of systems biology. T. Comp. Sys. Biology, 3737:145-168, 2005. 
[9] L. Cardelli. A process algebra master equation. In Proc. QEST, pages 219-226. IEEE, 2007. 
[10] L. Cardelli. Bitonal membrane systems: Interactions of biological membranes. Theoretical Computer Sci- 
ence, 404(l-2):5 - 18, 2008. Membrane Computing and Biologically Inspired Process Calculi. 
[11] F. Ciocchetta and J. Hillston. Process algebras in systems biology. In M. Bernardo, P. Degano, and G. Zavat- 

taro, editors, Proc. SFM, LNCS 5016, pages 265-312. Springer, 2008. 
[12] G. Conforti, D. Macedonio, and V. Sassone. Spatial logics for bigraphs. In L. Caires, G. R Italiano, L. Mon- 
teiro, C. Palamidessi, and M. Yung, editors, Proc. ICALP, volume 3580 of Lecture Notes in Computer Sci- 
ence, pages 766-778. Springer, 2005. 
[13] T. C. Damgaard, V. Danos, and J. Krivine. A language for the cell. Technical Report TR-2008-116, IT 

University of Copenhagen, Dec. 2008. 
[14] V. Danos and C. Laneve. Formal molecular biology. Theoretical Computer Science, 325, 2004. 
[15] E. Garcia-Garcia and C. Rosales. Signal transduction during fc receptor-mediated phagocytosis. J. Leukocyte 

Biology, 72(6): 1092-1 108, 2002. 
[16] J. Krivine, R. Milner, and A. Troina. Stochastic bigraphs. In Proc. 24th MFPS, ENTCS 218:73-96, 2008. 
[17] J. J. Leifer and R. Milner. Deriving bisimulation congruences for reactive systems. In C. Palamidessi, editor, 

Proc. CONCUR, volume 1877 of Lecture Notes in Computer Science, pages 243-258. Springer, 2000. 
[18] R. Milner. Pure bigraphs: Structure and dynamics. Information and Computation, 204(1):60-122, 2006. 
[19] R. Milner. The tower of informatic models. In From semantics to Computer Science; Essays in Memory of 
Gilles Kahn. Cambridge University Press, 2009. 



